Multilevel Annotation of Agreement and Disagreement in Italian News Blogs

نویسندگان

  • Fabio Celli
  • Giuseppe Riccardi
  • Firoj Alam
چکیده

In this paper, we present a corpus of news blog conversations in Italian annotated with gold standard agreement/disagreement relations at message and sentence levels. This is the first resource of this kind in Italian. From the analysis of ADRs at the two levels emerged that agreement annotated at message level is consistent and generally reflected at sentence level, and that the structure of disagreement is more complex. The manual error analysis revealed that this resource is useful not only for the analysis of argumentation, but also for the detection of irony/sarcasm in online debates. The corpus and annotation tool are available for research purposes on request.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CorEA: Italian News Corpus with Emotions and Agreement

English. In this paper, we describe an Italian corpus of news blogs, including bloggers’ emotion tags, and annotations of agreement relations amongst bloggercomment pairs. The main contributions of this work are: the formalization of the agreement relation, the design of guidelines for its annotation, the quantitative analysis of the annotators’ agreement. Italiano. In questo articolo descrivia...

متن کامل

Anaphoric Annotation of Wikipedia and Blogs in the Live Memories Corpus

The Live Memories corpus is an Italian corpus annotated for anaphoric relations. The corpus includes manual annotated information about morphosyntactic agreement, anaphoricity, and semantic class of the NPs. For the annotation of the anaphoric links the corpus takes into account specific phenomena of the Italian language like incorporated clitics and phonetically non realized pronouns. The Live...

متن کامل

Annotating Sentiment and Irony in the Online Italian Political Debate on #labuonascuola

In this paper we present the TWitterBuonaScuola corpus (TW-BS), a novel Italian linguistic resource for Sentiment Analysis, developed with the main aim of analyzing the online debate on the controversial Italian political reform “Buona Scuola” (Good school), aimed at reorganizing the national educational and training systems. We describe the methodologies applied in the collection and annotatio...

متن کامل

Crowdsourcing Ambiguity-Aware Ground Truth

The process of gathering ground truth data through human annotation is a major bottleneck in the use of information extraction methods. Crowdsourcing-based approaches are gaining popularity in the attempt to solve the issues related to volume of data and lack of annotators. Typically these practices use inter-annotator agreement as a measure of quality. However, this assumption often creates is...

متن کامل

Ethical Agreement and Disagreement about Obesity Prevention Policy in the United States

An active area of public health policy in the United States is policy meant to promote healthy eating, reduce overconsumption of food, and prevent overweight/obesity. Public discussion of such obesity prevention policies includes intense ethical disagreement. We suggest that some ethical disagreements about obesity prevention policies can be seen as rooted in a common concern with equality or w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016